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ABSTRACT 

N.4>Gamma is a strand-specific and site-specific DNA 
nicking enzyme (YCG^GT or ACtCGR). Here we de- 
scribe the isolation of single and double mutants of 
N.4>Gamma with attenuated activity. The nicking do- 
mains (NDs) of E59A and 11 double mutants were 
fused to the 5mCG-binding domain of MBD2 and 
generated fusion enzymes that preferentially nick 
5mCG-modified DNA. The CG dinucleotide can be 
modified by C5 methyltransferases (MTases) such as 
M.SssI, M.Hhal or M.Hpall to create composite sites 
ACfYGG N (8 _ 15 ) 5mCG. We also constructed a fusion 
enzyme 2xMBD2-ND(N.BceSVIII) targeting more fre- 
quent composite sites AStYS N( 5 _ 12 ) 5mCG in Mn 2+ 
buffer. 5mCG-dependent nicking requires special di- 
gestion conditions in high salt (0.3 M KCI) or in Ni 2+ 
buffer. The fusion enzyme can be used to nick and 
label 5mCG-modified plasmid and genomic DNAs 
with fluorescently labeled Cy3-dUTP and potentially 
be useful for diagnostic applications, DNA sequenc- 
ing and optical mapping of epigenetic markers. The 
importance of the predicted catalytic residues D89, 
H90, N106 and H115 in N.t|>Gamma was confirmed 
by mutagenesis. We found that the wild-type enzyme 
N.4>Gamma prefers to nick 5mCG-modified DNA in 
Ni 2+ buffer even though the nicking activity is sub- 
optimal compared to the activity in Mg 2+ buffer. 

INTRODUCTION 

Restriction and modification systems are one of the bac- 
terial defense mechanisms against invading bacteriophage, 
transposable elements and conjugative plasmids (reviewed 
in (1,2)). Bacteria protect their chromosomal DNA by 
modifying their own sequences, for example, by adding a 
methyl group to C5 position of cytosine to generate 5- 
methylcytosine (5mC), resulting in resistance to cleavage 
by the cognate restriction endonuclease (REase). In con- 
trast, the Type IIM REases Dpnl and Type IV REases 



McrBC, Mrr and GmrSD cleave foreign DNAs only 
when they are modified (for example, 5mC, 5hmC (5- 
hydroxylmethylcytosine) or glucosyl-5hmC) (reviewed in 
(3)). In theory, nicking variants could be isolated from the 
modification-dependent REases, but no such variants have 
been successfully engineered. 

Strand-specific and site-specific nicking enzymes are 
found in nature as one subunit of heterodimeric restric- 
tion enzymes (e.g. Nb.BsrDI and Nt.BstNBI) (4,5) or as 
DNA nicking-modification systems (e.g. Nt.CviPII and 
Nt.CviQII) (6-8). In addition, a number of natural hom- 
ing endonucleases (HEases) such as I-Hmul and I-BasI rec- 
ognize and nick a degenerate 16- to 24-bp sequence (9). 
Nicking enzymes have also been engineered from Type IIS 
REases and HEases (10-12). Another group of sequence- 
specific nickases called relaxases is often found in conjuga- 
tive plasmids that are involved in initiation of DNA repli- 
cation and DNA strand transfer (13). In the past few years, 
zinc finger nickases as fusions of zinc finger protein and 
Fokl cleavage domains have been constructed to introduce 
sequence-specific nicks in genomic DNA for gene targeting 
(gene addition and gene deletion) (14,15). A nicking do- 
main (ND) variant of a DNA repair enzyme (MutH) has 
been fused to a TAL effector protein to create a rare nicking 
enzyme suitable for gene targeting (16). However, no robust 
5mC-specific nickases have been found in nature despite ex- 
tensive enzyme screening in the past 30 years (see REBASE) 
(17). 

We recently discovered a large group of HNH nicking en- 
zymes encoded by phage and prophage; these HNH nicking 
enzymes are located next to terminase small and large sub- 
units involved in DNA packaging (18). The HNH motif is a 
small nucleic acid binding and cleavage module with a single 
metal ion (usually a Zn ion) bound by tetrahedral coordina- 
tion (19-21). To date, more than 1000 proteins containing 
this motif have been discovered and they belong to differ- 
ent subgroups of enzymes such as HEases, bacterial toxins 
(colicins) with non-specific nuclease activity, modification- 
dependent endonuclease McrA, DNA repair enzymes, pu- 
tative helicases or reverse transcriptases. It has been pro- 
posed that the conserved His residue in the HNH motif 
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(HNK or HNN) activates a water molecule for a nucle- 
ophilic attack on the sugar phosphate backbone of nucleic 
acids by behaving like a general base (20,22,23). 

The aim of this study is engineering of 5mC-specific nick- 
ing endonucleases (NEases or nickases). DNA methyla- 
tion is an epigenetic process that is involved in the regu- 
lation of gene expression and cell differentiation, and the 
spectrum of methylation levels is broadly variable in mam- 
malian DNA (24). The change in methylation profile is cat- 
alyzed by a family of DNA methyltransferases (DNMT1, 
DNMT3a and DNMT3b) (reviewed in (25)) and also reg- 
ulated through DNA demethylation by the Tet family en- 
zymes and T/G mismatch repair enzyme TDG (26-28). 
Methylated cytosines (5mC) are recognized by the methyl- 
binding domain (MBD) proteins that mediate the recruit- 
ment of chromatin remodeling enzymes (24). It has been 
demonstrated that cancer cells have global alterations in 
chromatin structure and epigenetic abnormalities play a 
role in different steps of cancer, from initiation to sur- 
vival and invasion (reviewed in (29,30)). The analysis of 
the methylation profile can help to differentiate tumor cells 
from normal cells and is a useful tool for cancer diagnosis. 
Both hypo- and hyper-methylations are known to play a role 
in the onset of oncogenic diseases by either activating onco- 
genes or by silencing tumor suppressor genes (31-35). Thus, 
the engineering of a 5mC-dependent NEase has a practi- 
cal utility by combining C5 MTases with 3-8 bp recogni- 
tions and a DNA ND with 2-7 bp specificity. It has been 
reported that nicking enzymes can be used to nick and label 
DNA for nicking site profiling (optical mapping), genomic 
fragment assembly, and ligase-free DNA fragment assembly 
and cloning (36-38). 

The MBD protein family members share a methyl-CpG- 
binding domain (~80 amino acid (aa) long) that has a spe- 
cific affinity for methylated CpG sites (5mCG) in double- 
stranded DNA. The MBD2 of this family appears to bind 
with a relaxed specificity to methylated DNA at 5mCG. It 
has been demonstrated that the multimerization of 5mCG- 
binding domain of MBD2 displays higher binding affin- 
ity to 5mCG in vitro than the monomeric peptide (39). In 
this work, two contiguous methyl-CpG-binding domains of 
MBD2 were constructed and then fused to a DNA ND to 
create a 5mCG-specific NEase. 

The nicking enzyme N.4>Gamma, encoded by Bacillus 
anthracis phage 7, is a small protein with 127-aa residues 
and its minimal DNA ND of 76-aa has been fused to the 
zinc finger protein Zif268 to create a rare nickase (18). 
Structure prediction revealed that the 76-aa catalytic do- 
main contains a (3-(3-a structure (also known as (3(3a- 
metal structural fold). The sequence specificity of this full- 
length NEase is YCG^GT (complement strand: ACfCGR) 
(Y = C/T; R = A/G) in Mg 2+ buffer and is reduced to 
~3-bp recognition in Mn 2+ buffer (i.e. one base differ- 
ent from the ACfCG site, e.g. ACfTG or ATfCG) (the 
down arrow indicates the nicked strand and the up ar- 
row indicates the opposite strand is nicked). A homolog of 
N.4>Gamma, N.BceSVIII, encoded by a prophage of Bacil- 
lus cereus ATCC 10987, whose ND was also used in fusion 
with 2xMBD2. N.BceSVIII, nicks the DNA sites SS^RT 
(AYtSS) (S = G/C) in Mn 2+ buffer and it has poor nicking 



activity in Mg 2+ buffer. Thus, the DNA nicking activity is 
Mn 2+ -dependent(18). 

In this manuscript, we report the isolation of N.<j)Gamma 
variants with reduced nicking activity, and the construction 
of fusion nicking enzymes by tethering DNA NDs to the 
5mCG-binding domain of MBD2, and characterization of 
nicking activities in Mg 2+ , Mn 2+ and Ni 2+ buffers. In addi- 
tion, we demonstrated DNA labeling using the fusion en- 
zyme on 5mCG-modified DNA. 

MATERIALS AND METHODS 

Bacterial strains, culture media, cloning vector and DNA sub- 
strates 

The NEB Turbo Escherichia coli competent cells were used 
for cloning and the E. coli T7 Express competent cells (New 
England Biolabs, Inc. (NEB), Ipswich, MA, USA) were 
used for protein expression. All restriction and modification 
enzymes and DNA polymerases were from NEB. For pro- 
tein expression experiments, pTYBl (7477 bp) was used as a 
cloning and expression vector (NEB). The stop codon of the 
target gene was eliminated and the gene was inserted into 
Ndel and Xhol sites, which allows the expression of target 
protein as a fusion to an intein-CBD tag (in the C-terminus 
of the target protein). The intein cleavage is induced by ad- 
dition of 50 mM Dithiothreitol (DTT) which releases the 
target protein from the intein-CBD tag that remains bound 
to the chitin column. 

The pBR322 (4361 bp) was used as a DNA substrate 
for some nicking activity assays. A modified pUC19 plas- 
mid (2686 bp) containing N.cf)Gamma nicking sites and 
the methylation sites of M.Hhal was used. The plasmid 
substrates were first methylated by M.SssI (CG), M.Hhal 
(GCGC) or M.Hpall (CCGG) (4 h methylation reactions). 
The modified DNA substrates were then purified by spin 
columns (Qiagen, Valencia, CA) and subjected to nicking 
digestion and run-off sequencing using BigDye® direct cy- 
cle sequencing kit (ABI/Life Technologies, Grand Island, 
NY). To check the extent of 5mC modification of plas- 
mid DNAs, the M.Hhal-, M.Hpall- and M.Eagl-modified 
DNAs were digested by the cognate restriction enzymes 
Hhal, Hpall and EagI, respectively, to make sure that more 
than 95% of the modified DNAs were resistant to restriction 
digestion. For M.SssI-modified plasmid DNAs, we used 
Hpall digestion to examine the extent of modification (re- 
sistance). 

The nicking activity assays were performed in different 
buffers supplemented with metal ions as stated in each 
experiment. For screening of N.c|)Gamma variants, two 
buffers (10 mM MgCl 2 or 1 mM MnCl 2 with 50 mM NaCl 
and 10 mM Tris-HCl, pH 7.5) were used. These two buffer 
conditions were chosen because N4>Gamma behaves dif- 
ferently in Mg 2+ and Mn 2+ (15). The nicked DNA samples 
were analyzed by electrophoresis on a 0.8% agarose gel and 
stained with ethidium bromide. 

To enhance 5mCG-binding specificity and nicking, 0.25- 
0.3 M KC1 was supplemented in some nicking reactions 
with Mg 2+ or Mn 2+ . Alternatively, 5mCG-dependent nick- 
ing of modified plasmid was achieved in a nickel buffer (50- 
75 mM NaCl, Tris-HCl, pH 7.5, 0.1-1 mM Ni 2+ ). Five hun- 
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dred nanograms of plasmid DNA were digested by the fu- 
sion enzyme for 1 h at 37°C. The samples were treated with 
Proteinase K to release the DNA that was tightly bound to 
the protein and analyzed by gel electrophoresis. 

Cloning of nicking domain of N.cf>Gamma and 5mCG- 
binding domain of MBD2 

The synthetic gene encoding two consecutive 5mCG- 
binding domains of MBD2 (2xMBD2) with optimized 
E. coli expression codons were purchased from IDT (In- 
tegrated DNA Technologies, Coralville, IA) and sub- 
cloned into pTYBl (Ndel-Xhol fragment). The 2x 5mCG- 
binding domains of MBD2 (2xMBD2 for abbreviation) 
contain 235-aa residues and have the following aa sequence: 
(Ndel site at the N-terminal coding sequence) 
MRATESGKRM DCPALPPGWK KEEVIRKSGL 
SAGKSDVYYF SPSGKKFRSK PQLARYLGNA 
VDLSSFDFRT GKMMPSKLQK NKQRLRNDPL 
NQNKFRLIKK QTLIGL GASG ASGASGGS RA 
TESGKRMDCP ALPPGWKKEE VIRKSGLSAG 
KSDVYYFSPS GKKFRSKPQL ARYLGNAVDL 
SSFDFRTGKM MPSKLQKNKQ RLRNDPLNQN 
KFRLIKKQTL IGL GASGASG ASGAS 

(Nhel-BamHI-XhoI sites after the C-terminal coding se- 
quence; underlined aa sequence, linker between functional 
domains). 

The coding sequence for the minimal nicking domain 
(76-aa, ND(N.c|)Gamma), ND = nicking domain) was am- 
plified by polymerase chain reaction (PCR) and fused to 
2xMBD2 (ligation into Nhel and Xhol sites) or fusions 
were constructed by the Gibson assembly method (NEB). 

N.BceSVIII ND coding sequence (222 bp, 74-aa) was am- 
plified by PCR and inserted in the pTYBI plasmid con- 
taining 2xMBD2 (Nhel-Xhol fragment insert). The aa se- 
quence of the fusion enzyme is shown below: 

MRATESGKRM DCPALPPGWK KEEVIRKSGL 
SAGKSDVYYF SPSGKKFRSK PQLARYLGNA 
VDLSSFDFRT GKMMPSKLQK NKQRLRNDPL 
NQNKFRLIKK QTLIGL GASG ASGASGGS RA 
TESGKRMDCP ALPPGWKKEE VIRKSGLSAG 
KSDVYYFSPS GKKFRSKPQL ARYLGNAVDL 
SSFDFRTGKM MPSKLQKNKQ RLRNDPLNQN 
KFRLIKKQTL IGL GASGASG ASGAS rygra wkrirdsyia 
ahplceeckr qgkltpanev hhilplargg thdrsnlmal ctpchsaita 
rdgdrwgtr (the aa residues shown in lower case are the NDs 
of N.BceSVIII; underlined aa residues, linker sequence). 

The recombinant plasmids were transformed into E. coli 
T7 Express competent cells and plasmids were extracted by 
a mini-prep kit (Sigma-Aldrich, St Louis, USA). Plasmids 
with correct inserts were screened by restriction digestion 
and further verified by DNA sequencing. 

Site-directed mutagenesis 

Site -directed mutagenesis of N.c))Gamma was carried out by 
inverse PCR. The appropriate mutagenic primers were or- 
dered from IDT and mutagenesis was performed by 25 cy- 
cles of PCR (Phusion DNA polymerase, NEB). The tem- 
plate DNA was destroyed by 1 h digestion at 37° C with 
DpnI. The amplified DNA was purified by spin-column pu- 
rification (Qiagen), and then phosphorylated and ligated at 



room temperature with T4 polynucleotide kinase and T4 
DNA ligase. The ligated DNA was used in transformation. 
Mutant alleles were sequenced to confirm the desired mu- 
tation^). 

Protein expression and purification 

Late-log phase cells were induced with 0.5 mM IPTG 
and grown in a shaker for 4 h at 37°C for the 
N.ct>Gamma endonuclease and 5 h at 30°C for the 2xMBD2- 
ND(E59A or other variants of N.4>Gamma) and 2xMBD2- 
ND(BceSVIII) fusion proteins in Luria broth (LB) medium 
supplemented with ampicillin. Induced cells were harvested 
by centrifugation and stored at — 20°C. Small-scale purifica- 
tion was performed using 30 ml of IPTG-induced cells. Cell 
pellets were resuspended in 0.5 ml lysis buffer (0.5 M NaCl, 
20mMTris-HCl,pH8.5, 1 mM EDTA, 0. 1 % Triton X- 100) 
and then subjected to sonication on ice. The cell lysate was 
centrifuged to eliminate cell debris. Three hundred micro- 
liters of chitin beads (NEB) were dispensed into a 1.5 ml 
Eppendorf tube and washed twice with 1 ml column buffer 
(0.5 M NaCl, 20 mM Tris-HCl, pH 8.5). The cell extract 
was added to the resin and agitated for 1 h at 4°C. After 
this incubation, the resin was washed three times with 1 ml 
column buffer, and 600 julI of cleavage buffer (0.5 M NaCl, 
20 mM Tris-HCl, pH 8.5, 50 mM DTT) was added. DTT 
cleavage occurred overnight at 4°C. Then, the resin was gen- 
tly mixed and centrifuged and the supernatant containing 
the purified protein was removed for analysis. The proce- 
dure for chitin column purification was followed based on 
NEB's purification manual with a minor modification: the 
cell extract was loaded onto a 10 ml chitin column that was 
equilibrated with chitin column buffer and the flow-through 
was reloaded to the resin three times by gravity flow. After 
the column was washed 10 times with 10 ml column buffer, 
10 ml of cleavage buffer (0.5 M NaCl, 20 mM Tris-HCl, pH 
8.5, 50 mM DTT) were added and the intein cleavage took 
place overnight at 4°C. The target protein was then eluted 
and analyzed by sodium dodecyl sulphate-polyacrylamide 
gel electrophoresis (SDS-PAGE). 

A 5 ml HiTrap heparin HP column (GE Life Sciences) 
was used for second step purification of the fusion enzyme. 
The samples eluted from the chitin column were diluted in 
column buffer (20 mM Tris-HCl, pH 7.5, 1 mM DTT, 1 mM 
EDTA, 20 mM NaCl, 5% glycerol) to reach a final concen- 
tration of 50 mM NaCl. Proteins were loaded onto a hep- 
arin Sepharose column using an AKTA FPLC purification 
system (GE Life Sciences). Elution was carried out using a 
salt gradient of elution buffer (50 mM to 1 M NaCl, 20 mM 
Tris-HCl, pH 7.5, 1 mM DTT, 1 mM EDTA, 5% glycerol). 
The eluted fractions corresponding to UV absorption peaks 
were analyzed by SDS-PAGE. The fractions containing the 
desired protein were pooled, concentrated if necessary and 
stored at -20°C in 50% glycerol. 

Gel shift assays 

A DNA mobility shift assay in native gel was used to as- 
sess DNA binding activity of 2x 5mCG-binding domain of 
MBD2 and mutants of N.cf>Gamma (40). Varying amounts 
of protein were mixed with oligonucleotides in a binding 
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buffer (20 mM HEPES pH 7.9, 10% glycerol, 50 mM KC1, 
10 mM EDTA). After 30 min incubation at room tempera- 
ture, 4 |xl of glycerol was added and the mixture was loaded 
onto a 4-20% TB native gel (Life Technologies) and ran 45 
min at constant 200 V on ice (pre -ran gel for 30 min, gel 
box immersed in an ice-water box). The gel was stained with 
SYBR gold (Life Technologies) and gel imaging was carried 
out with 526 nm fluorescent emissions in the Typhoon 9400 
phosphoimager (GE Life Sciences). The gel images were an- 
alyzed using the ImageJ software. 



Nicking of 5mCG-containing duplex oligos 
(CGNi 4 ACCGGN 13 CG) 

Duplex oligos of 49 bp were synthesized based on one pre- 
ferred nicking site found in pBR322, which contains the fol- 
lowing sequence: 

5'GAGTGG CG ATAAGTGGTGTCTT ACfCGG 
GTTGGACTCAAGA CG ATAGTTA3' (top strand). 

5'TAACTAT CG TCTTGAGTCCAAC CCG|GT 
AAGACACCACTTAT CG CCACTC-FAM 3' (FAM- 
labeled bottom strand). 

Three CG sites in the duplex oligos were methylated by 
M.SssI in NEB buffer 2 plus SAM (4 h) and then puri- 
fied through Bio-Spin 6 spin columns (Bio-Rad, CA). The 
methylation status was confirmed by Hpall digestion as the 
modified site C5mCGG confers resistance to Hpall cleav- 
age. Duplex oligos (20-100 ng) were digested with the fu- 
sion nicking enzymes or WT N.cf>Gamma. Following pro- 
tease K treatment, the nicked product of FAM-labeled 24- 
nt oligo was boiled for 5 min and detected by fluorescence 
imaging after electrophoresis in 4-20% Tris-Borate-EDTA 
(TBE) gels. For preferential nicking of the modified oligos, 
a Ni 2+ buffer (50 mM NaCl, 10 mM Tris-HCl, pH 7.5, 0.5 
mM NiCl2) was used in the nicking reactions. 
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Figure 1. Nicking activity assay for mutants of N.cj>Gamma. (A) One mi- 
crogram of pBR322 was digested by purified enzyme (small scale) in a 
Mg 2+ buffer (NEB buffer 2). L = 2-log DNA ladder (NEB), WT = wild- 
type N.(j>Gamma. Single mutant aa substitution was indicated on top of 
each lane. NC: nicked circular; L: linear; SC: supercoiled. (B) Run-off se- 
quencing of plasmid DNA nicked by the full-length E59A nickase. Arrows 
indicate nicked site with double peaks at A/C. (C) Schematic diagram of 
DNA run-off sequencing to determine the nicked site. A doublet peak with 
extra A indicates the nicked site. The height of base-call peaks following 
the doublet drops off due to the nicked template. 



DNA labeling with Cy3-dUTP 

For labeling DNA by strand-displacement extension, 1 jxg 
of pBR322 unmethylated or M.SssI-methylated was di- 
gested by the 2xMBD2-ND(E59A of N.4>Gamma) fusion 
enzyme in 0.25 M KC1 for 1 h at 37°C. After 20 min heat- 
inactivation of the enzyme at 80° C, the samples were puri- 
fied by spin-column purification. Labeling was performed 
by mixing the purified DNA with 1 mM dTTP, dCTP and 
dATP (NEB) with 40 pM Cy3 dUTP (GE Life Sciences) and 
10 units of the Klenow fragment of E. coli DNA polymerase 
1 in 50 (jul total volume. The mixture was incubated 5-10 min 
at room temperature and then analyzed in an agarose gel 
without ethidium bromide. NIH 3T3 mouse genomic DN A 
was nicked by 2xMBD2-ND(E59A) in a Mg 2+ buffer con- 
taining 0.3 M KC1 for 1 h at 37°C. DNA labeling was per- 
formed through 10 min incubation at 25°C with Klenow 
fragment of E. coli DNA polymerase I and a dNTP mix 
containing Cy3-dUTP. Labeled DNA band images were ob- 
tained under 580 nm fluorescent emissions using Typhoon 
9400 phosphoimager (GE Life Sciences) and images were 
edited using the ImageJ software. 



RESULTS AND DISCUSSION 

Isolation of mutants of phage Gamma HNH nicking endonu- 
clease (N.<J>Gamma) 

The full-length phage Gamma HNH nicking endonucle- 
ase (N.c|)Gamma, 127-aa) and its minimal ND (76-aa) have 
been described previously (18). The nicking specificity of 
N.c|)Gamma is YCGJ^GT The 76-aa ND of N.c|>Gamma 
displays more degenerate specificities with one base off 
from the cognate site CGJ,GT under star conditions 
(high enzyme/glycerol concentration or digestion in Mn 2+ 
buffer). In this work, we isolated 54 single mutants and 3 
double mutants of N.(f>Gamma mainly targeting charged 
and hydrophilic aa residues or predicted catalytic residues. 
The goal of this mutagenesis experiment was to confirm the 
HNH catalytic residues and isolation of variants with atten- 
uated nicking activity (as a result of weakened binding). The 
mutant proteins were partially purified by batch purification 
with chitin beads and nicking activity tested on pBR322 
(Figure 1 and data not shown). Figure 1A shows the nick- 
ing activity of nine variants in Mg 2+ buffer. The mutants fall 
into two major groups. Group 1 variants show full or par- 
tial nicking activity in Mg 2+ buffer (e.g. E59A and Rl 17A). 
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Group 2 variants show poor nicking activity (e.g. N106A, 
H115A and K125A) or no activity (e.g. D89A and T119A) 
in Mg 2+ buffer. In Mn 2+ buffer, however, some poor activ- 
ity mutants such as H90A, N106A, HI 15A and Tl 19A still 
display partial nicking activities (data not shown). In order 
to eliminate the nicking activity completely in Mn 2+ buffer, 
three double mutants in the HNH motif (H90A/N106A, 
H90A/H115A and N106A/H115A) were constructed and 
the mutant proteins were purified and assayed for nicking 
activity. As expected, the double mutants had minimal de- 
tectable nicking activity in Mn 2+ buffer (data not shown). A 
number of active variants were further analyzed by run-off 
sequencing of the nicked sites. One active mutant, E59A, 
was found to nick the site CCG^GT preferentially. The 
range of nicking activity on ACCGN sites is as following: 
ACCGG>ACCGA>ACCGT>ACCGC as demonstrated 
by run-off sequencing of the nicked sites in pBR322 (Fig- 
ure IB). We choose to work on E59A for the following rea- 
sons: it is as active as the WT enzyme, E59A prefers to nick 
CCG4,GT (ACCGG) site, and the E59 to A substitution is 
located outside the HNH catalytic core. Later it was shown 
that the specificity of the 76-aa ND of E59A was confined 
to ACCGG or ACTGG sites when its ND was fused to a 
5mCG-binding domain (ACYGG, Y = C/T) (see below). 
We have not tested the nicking specificity of the stand-alone 
76-aa ND of E59A. A schematic diagram of DNA run-off 
sequencing is shown in Figure 1C. When Taq DNA poly- 
merase encounters a nick in the template, it adds an ade- 
nine base by the template-independent terminal transferase 
activity, thus creating a doublet peak (A/C). 

Table 1 summarizes the single mutant activities and pro- 
tein expression level. Relative nicking activity (+++, ++, 
+, +/— or — ) was estimated in Mg 2+ or Mn 2+ buffer 
in comparison with the WT enzyme. Most noticeably, 14 
out of 15 inactive and poor activity variants (+/— or — ) 
in Mg 2+ buffer are located in the 76-aa long C-terminal 
HNH ND with the predicted (3(3a-metal structure. The 
protein expression level of three variants (R62A, K77A 
and K81A) was much reduced, suggesting possible protein 
folding defect. A negatively charged residue D89 in front 
of the predicted catalytic residues H90-N106-H115 is con- 
served in many HNH nucleases and is critical for the cat- 
alytic activity. D89A lost nicking activity in both Mg 2+ 
and Mn 2+ buffers. Other poor activity or inactive mutants 
are K95A, Q108A, N116A, K118A, T119A, K123A and 
K124A, whose aa substitutions are located in the predicted 
(3(3a-metal fold. In structure-guided secondary structure 
prediction and structure modeling by the Phyre server in- 
dicated that N.4>Gamma is similar to a putative DNA re- 
pair enzyme Gme HNH endonuclease found in Geobacter 
metallireducens GS-15 (data not shown) (21). 

Engineering fusion nicking enzymes targeting 5mCG sites 

The 5mCG-binding domain of MBD2 is able to bind to 
modified 5mCG site. When the 5mCG-binding domain is 
duplicated or concatenated into three and four copies (2x, 
3x and 4x), they show higher binding affinity to 5mCG- 
modified DNA (39). Therefore, the fusion nicking enzymes 
we constructed carry two copies of 5mCG-binding domain 
of MBD2 (2xMBD2) tethered to a DNA nicking domain 



2x5mCpG binding domain (BD)--linker--phage y HNH nicking domain (ND) 



B. 




Figure 2. Schematic diagram of 2xMBD2-ND(N.<|)Gamma) functional 
domains and fusion enzyme purification. (A) A schematic diagram of the 
fusion protein 2xMBD2-ND(N.<|>Gamma). 5mCG BD, 5mCG-binding 
domain; 2x = duplicated 5mCG BD; ND, nicking domain. (B) SDS- 
PAGE analysis of partially purified fusion protein. The molecular mass of 
the fusion protein is predicted to be ~36 kDa. Lane 2, fusion protein puri- 
fied from a chitin column after DTT cleavage. Lanes 3-9, eluted fractions 
from a heparin Sepharose column. 



(ND, 76-aa) of N.4>Gamma with a 12-aa linker as dia- 
grammed in Figure 2A. From the mutagenesis experiment 
described above, we had obtained a small collection of 
N.c|)Gamma variants with full, partial or low nicking activ- 
ities. We have not constructed 3xMBD2 or 4xMBD2 in fu- 
sion with the HNH ND. Such fusion nicking enzymes may 
nick DNA at farther distance from the 5mCG-binding sites. 

A total of 16 fusion enzymes were constructed and par- 
tially purified by batch purification. Table 2 shows the rel- 
ative nicking activity in Mg 2+ buffer. In general, if the full- 
length mutant nickase is not very active in Mg 2+ buffer, 
the engineered fusion enzyme also has poor nicking activ- 
ity with the same metal cofactor (e.g. R62A, D89A, H90A, 
N106A, H115A and T119A in E59A background). That 
is, the nicking activity of these mutants cannot be rescued 
(suppressed) by fusion with another DNA binding part- 
ner (due to compromised catalytic residues or impaired 
DNA binding affinity of the ND or some unknown mech- 
anisms). The fusion enzymes are fully active (+++) or par- 
tially active (+ or ++) in Mg 2+ buffer, primarily depending 
on the 76-aa nicking variant used. However, when the ND 
of E59A/R68A, E59A/K95A and E59A/K123A was fused 
to a strong DNA binding partner 2xMBD2, partial nicking 
activity was restored (rescued). This suggests that the poor 
nicking activity of the full-length variants of R68A, K95A 
and K123A is likely due to attenuated DNA binding affin- 
ity. 

We first characterized the fusion enzyme 2xMBD2- 
ND(E59A of N.4>Gamma). Figure 2B shows the par- 
tially purified fusion enzyme after chromatography through 
chitin and heparin columns. Figure 3 shows the fusion en- 
zyme nicks both unmodified and M.SssI-modified pBR322 
(M.SssI modifies CG sites to yield 5mCG) in NEB buffer 2 
or in the presence of 0.1-0.3 M KC1 (Figure 3A and B, lanes 



e77 Nucleic Acids Research, 2014, Vol. 42, No. 9 PAGE 6 OF 16 



Table 1. Nicking activities and protein expression levels of the N.c|>Gamma variants 



Mutant 


Protein expression level 


Nicking activity in Mg 


Nicking activity in Mn 


WT 


+++ 


+++ 


+++ 


R6A 


++ 


+++ 


+++ 


H12A 


++ 


++ 


+++ 


N15A 


++ 


NT 


+++ 


R18A 


++ 


+++ 


+++ 


R20A 


++ 


++ 


+++ 


H25A 


+++ 


+++ 


+++ 


R26A 


+++ 


+++ 


+++ 


D35A 


++ 


+++ 


+++ 


R38A 


++ 


+ 


++ 


D41A 


++ 


+ 


++ 


R42A 


+++ 


+++ 


+++ 


R45A 


++ 




+ 


D46A 


++ 


+ + 


+++ 


R54A 


+ 


+ 


+++ 


S55A 


+++ 


+ 


+++ 


K56A 


++ 


+ 


+++ 


E59A 


++ 


++ 


+++ 


R60A 


+++ 


+++ 


+++ 


R62A 


+ 


+ 


+ 


E63A 


+ 


++ 


+++ 


Q64A 


+ 


+ 


+++ 


R68A 


+ 


+ 


+ 


D69A 


++ 


++ 


++ 


K70A 


++ 


+ 


+++ 


H75A 


+++ 


+ + 


+++ 


K77A 








N79A 


++ 


NT 


+++ 


R80A 


++ 


+++ 


+++ 


K81A 






+ 


D86A 


++ 


+++ 


+++ 


D89A 


+++ 




+ 


H90A 


++ 


/ 


+ 


K95A 


+ 




+ 


D97A 


++ 


+++ 


+++ 


K101A 


++ 


+ 


+++ 


E105A 


++ 


++ 


+++ 


N106A 


+ 


+ /- 


+ 


Q108A 


++ 


/ 


+ 


N112A 


+ 


+ /- 


++ 


H115A 


+ 


+ /- 


+ 


N116A 


+ 


+ /— 

/ 


++ 


R117A 


++ 


+ 


++ 


R117C 


+ 


+ + 


+++ 


R117G 


+ 


+ + 


+++ 


R117S 


++ 


++ 


+++ 


R117V 


+ 


+ 


++ 


K118A 


+ 






T119A 


+ 




+ 


E121A 


++ 


+ 


+++ 


D122A 


+ 


+ 


++ 


D122G 


+ 


+ 


++ 


K123A 


+ 


+/- 


++ 


K124A 


+ 


+/- 


++ 


K125A 


+ 


++ 


+++ 



The poor activity and inactive mutants (Mg buffer) are shown in bold; the putative catalytic mutants are shown in bold and underlined. Note: the activity 
of some inactive mutants (Mg 2+ ) can be rescued in Mn 2+ buffer. NT, not tested. Plasmid pBR322 was used as the nicking substrate. Buffers used in the 
nicking assay: 50 mM NaCl, 10 mM Tris-HCl, pH 7.5. 10 mM MgCb or 1 mM MnCl2. The relative mutant enzyme activities are defined as follows: +++ 
good activity, 50-100% of WT; ++ partial activity, 10-49% of WT; +/— or + poor activity, 1-9%; — no activity, background nicked DNA as mock-treated 
pBR322. 



3-5). The fusion enzyme was able to nick 5mCG-modified 
DNA preferentially in 0.25-0.3 M KC1 (lanes 6 and 7), 
which is consistent with the report that addition of KC1 
to MBD1 enhances 5mCG binding over non-specific DNA 
(41). The preferential nicking of 5mCG-modified DNA was 
also observed for other fusion enzymes carrying two aa 



substitutions in the ND (Figure 3C; e.g. R68A, N112A, 
K123A, K124A and Rl 17V in E59A background). Not sur- 
prisingly, the catalytic-deficient nicking mutant H90A was 
inactive when its ND was fused to 2xMBD2; the fusion en- 
zyme from Kl 18A ND was also inactive as the parent (full- 
length) nicking variant (Figure 3C and data not shown). It 
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Figure 3. Effect of KCI concentration on the nicking specificity of 2xMBD2-ND(E59A) fusion enzyme. (A) Nicking reaction carried out in Mg 2+ buffer 
supplemented with 0.1-0.3 M KCI. 0.5 u,g of pBR322 (unmodified) was digested by 2.5 |xg of the fusion enzyme in NEB buffer 2 (50 mM NaCl, 10 mM 
MgCh, 10 mM Tris-HCl, pH 7.8) plus different concentrations of KCI for 1 h at 37°C. NC: nicked circular; L: linear; SC: supercoiled. (B) Same digestion 
condition as in (A), except that 5mCG-modified pBR322 was used. (C) Nicking reaction carried out in NEB buffer 2 plus 0.3 M KCI with partially purified 
E59A and seven nicking domains of double mutants (in E59A background) with modified (M + ) and unmodified (M~) pBR322. (D) Characterization of 
the nicking site of 2xMBD2-ND(E59A) in Mg 2+ buffer. Run-off sequencing of modified and nicked pBR322. One of the composite nicking sites is (CG 
N6-14) ACfCGG N13 CG. The arrow indicates the nicking site with A/C doublet. Both strands were sequenced. (E) A time course of digestion in 10-60 
min with fixed amount of fusion enzyme and DNA. 



was concluded that the nicking activity of some mutants 
could be rescued by fusion with a strong DNA binding part- 
ner such as 2xMBD2 and nicking at the modified sites was 
strongly dependent on the presence of 0.25-0.3 M KCI. 

To find out the exact nicked sites, we performed run-off 
sequencing of the digested pBR322 (unmodified or 5mCG- 
modified) by 2xMBD2-ND(E59A). The whole plasmid was 
sequenced on both strands and 10 partially nicked sites were 
found. Table 3 lists 10 nicked sites in modified pBR322. 
Eight sites conform to ACfYGG N (8 _ 15) CG. The run-off 
sequencing of the one nicked site was shown in Figure 3D 



(this nicking site was also confirmed on modified synthetic 
duplex oligos, see below). A doublet peak (A/C) was ob- 
served when the template strand was nicked (Taq DNA 
polymerase adds an adenine base by template-independent 
nucleotide transferase activity when the template is broken). 
The nicking specificity of the full-length (127 aa) nicking 
enzyme variant E59A is ACCGN (N, G>A>T>C) but the 
specificity of the 76-aa ND of E59A appears to be more re- 
laxed in the third position as ACCGG or ACTGG when it 
is fused to 2xMBD2 (the fifth position recognition appears 
to be more specific for G only). The simultaneous effect 
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Table 2. DNA nicking activity of 2xMBD2-ND(N.<|>Gamma) fusion 
enzymes 



Nicking domain variant 


Nicking activity in Mg- + buffer 


E59A 


+++ 


F5QA + R £T) A 


-i- 


C J"rt ~ IvOOrt 


.(— f. 


FSQA 4- FTQOA 




F5QA + K'QSA 




FSQA 4. KM O^A 




E59A + Q108A 


+/- 


E59A + N112A 


++ 


E59A + H115A 




E59A + N116A 


+ + 


E59A + R117A 


+ + 


E59A + R117V 


+ + 


E59A + K118A 


+/- 


E59A + T119A 


+/- 


E59A + K123A 


+ 


E59A + K124A 


+ 



Relative nicking activity: +++, ++, +, +/— , — , see definition in Table 1 
footnote. 0.5 (jug of M.SssI-modified pBR322 DNA was digested by the 
fusion enzyme for 1 h at 37°C in a buffer containing 50 mM NaCl, 0.3 M 
KC1, 10 mM Tris-HCl, pH 7.5, 10 mM MgCl 2 . 



on recognition on two base pairs was not well understood 
and remained to be studied. It was concluded that the com- 
posite sites in modified pBR322 are confined to ACfYGG 
N( 8 _i5) CG. In order to limit the spacer of the composite site 
to a more defined length, it may be necessary to make the 
5mCG-binding domain smaller or make the linker shorter 
between the 2xMBD2 and the ND or use only one copy 
of 5mCG-binding domain with a trade-off for lower DNA 
binding affinity. 

A time course of nicking reaction from 10 to 60 min was 
carried out in order to define the enzyme units (Figure 3E). 
An extended time course from 10 to 90 min was also per- 
formed (data not shown). One unit of the fusion enzyme was 
defined as the amount of enzyme (~3 jxg) required to com- 
pletely nick 0.5 (xg of 5mCG-modified pBR322 DNA in 50 
jjlI of reaction volume at 37°C for 1 h in NEB buffer 2 sup- 
plemented with 0.3 M KC1. The specific activity is approx- 
imately 333 units/mg protein. Such a low specific activity 
suggests that the fusion enzyme does not turn over in nick- 
ing DNA probably due to the tight binding by 2xMBD2. 
The other indirect evidence came from the observation that 
the fusion enzymes remain bound to the DNA. Protease K 
digestion was necessary to 'release' the DNA after the nick- 
ing reactions. Otherwise some of the nicked DN A would be 
bound to the wells in agarose gels during gel electrophoresis 
as observed in Figure 3E. 

76-aa minimal DNA nicking domain of N.cf>Gamma 

The 76-aa ND of E59A (N4>Gamma) can be further min- 
imized to 74-aa by fusion to 2xMBD (by deletion of two 
more aa residues: FY). Deletion of three aa residues (FYR) 
from the 76-aa ND abolished the nicking activity (data not 
shown). A stretch of ~13-aa residues at the beginning of 
the 76-aa ND is predicted to form an a-helix which is im- 
portant for the enzyme activity; deletion of this putative ct- 
helix eliminated the nicking activity (18). A large number 
of homologs (20-100% aa sequence identity) of the 76-aa 
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DNA ND encoded by phage and prophage genomes can 
be found in GenBank by BlastP search (data not shown), 
which can be potentially used in engineering fusion nick- 
ases. A HK97 phage encoded HNH enzyme with non- 
specific nicking activity was reported previously and could 
be used as a non-specific ND with minimal specificity. Inter- 
estingly, the HNH endonuclease from cf>HK97 prefers Ni 2+ 
for nuclease activity (42). 

Fusion enzyme nicking activity in Mn 2+ buffer 

The YCG^GT (ACfCGR) nicking specificity of 
N.rfjGamma can be further relaxed into 3 bp with one 
base different from the CGGT (ACCG) sites when the 
nicking reactions are performed in Mn 2+ buffer (18). 
Accordingly, the fusion enzyme may also show degenerate 
composite nicking sites. The fusion enzyme was diluted 
2-, 4- and 8-fold and used in digestion of 5mCG-modified 
pBR322 in Mn 2+ buffer (supplemented with 0.3 M KC1) 
for 30, 45 and 60 min. The results are shown in Figure 4A. 
The 5mCG-modified DNA was preferentially nicked and a 
small amount of linear DNA was also detected. The nicked 
sites were analyzed by run-off sequencing. Figure 4B shows 
one of the partially nicked sites: (CG N 6 ) ACCC N n CG 
or (CG N 6 ) ACCCG Ni 0 CG. The 12 nicked sites in Mn 2+ 
buffer for the fusion enzyme can be summarized as ASYS 
N (5 _i2) CG (S = G/C) (see Table 3). In general, the nicking 
sites ASYS fall into CG-rich regions (2-5 copies of CG 
dinucleotide), suggesting that the density of the 5mCG sites 
is likely important for the nicking activity. The alternative 
explanation is for the fusion enzyme to dimerize and nick 
the target DNA or through a looping mechanism. 

DNA modification by M.Hhal to create composite nicking 
sites 

In theory, the CG dinucleotide can be modified by C5 
methyltransferases (MTases) with 3-8 bp DNA recognition 
sequence. The combination of C5 MTase modification to 
create 5mCG and HNH site ACYGG nearby would gener- 
ate composite rare nicking sites. A different strategy of us- 
ing DNA MTase, REase and Lac repressor in the Achilles' 
heel cleavage procedure was used to cleave yeast and E. coli 
genomes at a single site. 

M.Hhal modifies the GCGC site to produce G5mCGC 
(see REBASE). A pUC19 derivative was modified by 
M.Hhal and then nicked by the fusion enzyme in 0.3 
M KC1. The M.Hhal-modified pUC19 derivative was re- 
sistant to Hhal digestion (data not shown). Figure 5A 
shows that the modified DNA was preferentially nicked 
by the fusion enzyme. Run-off sequencing of the nicked 
DNA confirmed that a specific nick (a doublet peak A/T 
indicated by an arrow) was introduced in the compos- 
ite site ACjTGG N M GCGC, which is equivalent to 
a 9-bp nicking site (Figure 5B). Other nicked compos- 
ite sites are ACfCGGataagGCGC (ACCGG N 6 CG) 
and ACfCGGatacctgtcCGC (ACCGG N 9 CG) (data not 
shown). C CG C is a possible star site for M.Hhal that could 
be modified at high enzyme concentration and prolonged in 
vitro modification (4 h). 

We also modified pBR322 with M.Hpall and the mod- 
ified DNA was subjected to nicking by the fusion enzyme. 
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Table 3. Nicking sites of 5mCG-modified pBR322 in Mg 2+ and Mn 2+ buffers as determined by DNA run-off sequencing 



Type of modified DNA 



Nicking sites in Mg 



2+ 



Nicking sites in Mn 



2+ 



M.SssI-modified pBR322 

Same 

Same 

Same 

Same 



Same 



Same 



Same 



M.Hpall-modified pBR322 
M.Hhal-modified pUC19 
M.Hhal-modified pUC19 



ACCGG N13 CG: 1 site 
ACCGG Nn CG: 1 site 
ACTGG N 10 CG: 2 sites 
ACTGG N 9 CG: 1 site 
ACCGG N 6 CG or 
ACCGG Nn CG*: 1 site 

ACCGG Ni CG or 
ACCGG N 4 CG or 

ACCGG Nn CG**: 1 site 



ACCGG Nn CG: 1 site 
ACCGG N 8 CG: 1 site 
ACTGG N14 CG: 1 site 
ACYGG N( 8 -i5) CG 



ACCG N12 CG: 1 site 
ACTG Ni 2 CG: 1 site 
ACCG Nn CG: 1 site 
ACCCNn CG: 1 site 
ACTG Nn CG: 1 site 
AGTGNn CG: 1 site 

ACCG Nio CG: 1 site 



AGCG N 7 CG: 1 site 

ACCG N 5 CG: 2 sites 

AGCG N 5 CG or 

AGCG N 8 CG: 1 site 

ACGG N 6 CG: 1 site 

Not tested 
Not tested 
Not tested 
ASYS N ( 5-i2) CG 



ACfCGGataaggCGcagCG. 
"ACt CGGgCGgCGgccaaagCG. 



JO mimrtei 45 mintrtw 1 ioui 

1,7 IH l/l ):.' 1/4 : : 



m 
-i 

-sc 



B. 



■■nil).*l(21»MI -• 

M.SssI-modified 
forward strand 

Unmodified 
forward strand 



(CGN 6 ) ACCCN U CG 



^ C 1 T ♦ A T TC CTCC 7CT T T G A 6 C ^ CCGCTAGGCTG G 6 6 C T T 



■ i : . : ■ -C C' t(-{-C<T:CiCC>(CCCC(TlC((T((C«CCCTT 



Figure 4. Nicking activity assay for 2xMBD2-ND(E59A) fusion enzyme for methylated DNA in Mn 2+ buffer. (A) 0.5 u.g of 5mCG-modified pBR322 was 
digested by different amounts of enzyme (0.3-1.3 jig) in a buffer containing 10 mM MnCl 2 , 10 mM Tris-HCl, pH 7.8, 50 mM NCI and 0.3 M KC1 in 30 
min, 45 min and 1 h at 37°C. M+, 5mCG-modified substrate; M— , unmodified substrate. (B) Run-off sequencing of the pre-nicked modified DNA. The 
composite nicking site is (CGN6-17) ACfCC Nn CG. Arrow indicates the nicked site with A/C doublet. 



e77 Nucleic Acids Research, 2014, Vol. 42, No. 9 



Page 10 of 16 



M* (M.Hhal) 




+ Fusion nickase 



B. »ii-rai.»i«s>Nii -i 

M.Hhal-modified 
reverse strand 



mturm.<mnxai 
M.Hhal-modified 
forward strand 



^_ 



. ^ ■ i • l I I I i I ! ) t l -• • J ( I i > i. j „ _ ■ 

t n t : i n i i ii ( i n::: t t t i i [ i m i i 



( t c i t : t i t c c c i 



| ACTGG N 14 GCGC 

Figure 5. Nicking activity assay of 2xMBD2-ND(E59A) using a pUC19 derivative. (A) One microgram of M.Hhal-modified or unmodified pUC19 was 
digested by 2.5 u,g of the fusion nicking enzyme in NEB buffer 2 supplemented with 0.3 M KC1 for 2 h at 37°C. M.Hhal modifies the site GCGC to produce 
G 5mCG C, which subsequently serves as the substrate for the 5mCG-binding domain. NC: nicked circular; SC: supercoiled. (B) Run-off sequencing of 
nicked DNA modified by M.Hhal. The composite site ACTGG N14 G CG C is shown. Arrow indicates the nicking site with A/T doublet. No nick was 
found in sequencing of the opposite strand (top panel). 



The nicked DNA was analyzed by run-off sequencing. One 
nicking site was found at CCGG N 4 ACfCGG N 10 GCGG 
(data not shown). In this region, two Hpall sites and one 
star site (GCGG) are present and are likely modified by 
M.Hpall. It is known that M.Hpall modified the star site 
GCGG at high enzyme concentration (M. Zhang and R. 
Roberts, unpublished results). It is tentatively concluded 
that M.Hpall can be used to create composite sites con- 
taining 5mCG. In a control experiment, a pUC19 deriva- 
tive with a single EagI site was modified in vivo by M.EagI 
co-expression. M.EagI modifies the site CGGCCG to gen- 
erate CGG5mCCG (REBASE). The modified plasmid was 
extracted and digested by EagI, Ndel or the fusion nicking 
enzyme. The modified and unmodified plasmid DNAs were 
poorly nicked by the fusion enzyme in 0.3 M KC1, indicating 
that 5mCC dinucleotide is not a good substrate; the mod- 
ified plasmid is resistant to EagI digestion and sensitive to 
Ndel digestion (data not shown). 

Fusion enzyme activity in other divalent cations 

One fusion enzyme 2xMBD2-ND(E59A/T119A) has poor 
nicking activity in Mg 2+ buffer. Since the activity of some 
of the full-length mutants of N.cf>Gamma can be rescued by 
the presence of Mn 2+ cofactor, we tested the fusion enzyme 
activity in Mn 2+ buffer and in other metal ions. Figure 6 
shows that 2xMBD2-ND(E59A/Tl 19A) possesses 5mCG- 
dependent DNA nicking activity in Mn 2+ buffer, no activ- 
ity in buffers with Mg 2+ , Ca 2+ or Co 2+ (the enzyme and 
DNA formed a brown precipitate in Co 2+ buffer, data not 
shown), and a low but detectable nicking activity in 1 mM 



Ni 2+ (these reactions were performed in the absence of 0.3 
M KC1). It was noted during the mutagenesis experiment 
that the full-length nicking variant T119A displays poor 
activity in Mg 2+ buffer but the activity can be rescued by 
the presence of Mn 2+ (see Table 1). It was concluded that 
the fusion enzyme 2xMBD2-ND(E59A/T119A) displays 
5mCG-dependent DNA nicking activity in Mn 2+ and Ni 2+ 
buffers. Another fusion enzyme 2xMBD2-ND (E59/R62A) 
also displays preferential nicking of modified DNA in nickel 
buffer (1 mM Ni 2+ ). Nickel ions may enhance 5mCG nick- 
ing specificity by binding to the HNH ND (see more Ni 2+ - 
dependent nicking below). It is well known that divalent 
cations can alter restriction enzyme specificities. For ex- 
ample, Mg 2+ ions enhance EcoRV binding specificity to 
cognate sites (43). Ca 2+ ions can enhance Kpnl binding 
specificity to cognate sites (but Mg 2+ and Mn 2+ promote 
star site binding and cleavage). Kpnl belongs to the HNH- 
family endonucleases and high-fidelity (HF) mutants of 
Kpnl have been isolated that increased specificity in Mg 2+ 
buffer (reduced star activity) and altered metal ion require- 
ment; KpnI-HF displays no activity in Ca 2+ buffer and 
high-fidelity in Mg 2+ buffer (44). For the HpyAV restriction 
enzyme (also belonging to the HNH endonuclease family), 
however, Ni 2+ is preferred for binding and cleavage of the 
cognate sites (45). Another HNH endonuclease Sco JVlcrA 
requires Mn 2+ ions for efficient binding and cleavage of 
5mC-modified DNA (46). The HNH endonuclease gp74 of 
phage HK97, however, prefers Ni 2+ for nicking plasmid and 
phage DNAs although the exact nicking sites of gp74 have 
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Figure 6. Nicking activity assay in buffers containing different metal ions for the fusion enzymes 2xMBD2-ND(E59A/R62A) and 2xMBD2- 
ND(E59A/T119A) (nicking domain double mutants). Methylated pBR322 (0.5 u,g) by M.SssI (M) or unmodified (U) were digested by 2.5 u.g of the 
fusion enzyme in buffers containing 10 mM Tris-HCl, pH 7.5, 50 mM NaCl and supplemented with different metal ions for 1 h at 37°C. The metal ions 
and concentration were indicated on top of each lane. Nicking activity was detected in Mg 2+ , Mn 2+ and Ni 2+ buffers for 2xMBD2-ND(E59A/R62A); 
nicking activity was detected in Mn 2+ and Ni 2+ buffers for 2xMBD2-ND(E59A/Tl 19A). These digestions did not include 0.3 M KC1. 



not been determined; gp74 of phage HK97 displays poor 
endonuclease activity in Mg 2+ buffer (42,47). 

Fusion of N.BceS VIII minimal nicking domain to 2xMBD2 

The nicking specificity of the wild-type N.BceSVIII is 
SS^RT (complement strand: AYfSS) in Mn 2+ buffer and it 
has poor nicking activity in Mg 2+ buffer (18). By pair-wise 
aa sequence alignment of N.cf>Gamma and N.BceSVIII, the 
minimal DNA ND of 74-aa was predicted for N.BceSVIII 
(data not shown). The DNA coding sequence for the 74- 
aa ND was amplified by PCR and ligated to pTYBl- 
2xMBD2 (a 10-aa linker was engineered between 2xMBD 
and the ND). The fusion enzyme was partially purified 
by chromatography through chitin and heparin columns 
(Figure 7A). As expected, the fusion enzyme 2xMBD2- 
ND(N.BceSVIII) nicks DNA poorly in Mg 2+ buffer (Fig- 
ure 7B). In the presence of 0.3 M KC1 and 10 mM Mn 2+ , 
however, the fusion enzyme shows preferential nicking of 
5mCG-modified DNA. Some linear DNA was also present 
after the digestion that may result from two nicks in close 
proximity on both strands. Figure 7C shows one nicked site 
in run-off sequencing analysis, indicating the composite site 
as (CGNi) ACCC N 5 CG or (CGN0 ACCCG N 4 CG (a 
total of five CG sites are present in close proximity). 

Nicking of duplex oligos with the composite nicking sites CG 
N M ACCGG N 13 CG 

FAM-labeled synthetic oligos with the sequence N 6 CG N14 
ACCGG N B CG N7 was modified by M.SssI and used as 
the substrate for nicking in Mg 2+ and Ni 2+ buffers. The fu- 
sion nicking enzymes 2xMBD2-ND(E59A) and 2xMBD2- 
ND(E59A/R62A) nick modified oligos slightly better in 
Mg 2+ buffer supplemented with 0.3 M KC1 than unmodi- 
fied oligos (data not shown). In Ni 2+ buffer, however, nick- 
ing of modified oligos was significantly pronounced than 
unmodified oligos. Figure 8A and B shows the nicking 
activity of 2xMBD2-ND(E59A/R62A) on modified (M + ) 
and unmodified (M~) duplex oligos in the presence of 0.5 



mM Ni 2+ in a time course (10-60 min, lanes 1-6). The 
WT enzyme (full-length) N.c|)Gamma nicks the oligos more 
efficiently in Mg 2+ buffer than in Ni 2+ buffer regardless 
of methylation status (Figure 8A, B, lanes 7 and 8). In 
the control experiment to test modification level, major- 
ity of the modified oligos were resistant to Hpall digestion 
(Figure 8A, lane 10); and the unmodified oligos were di- 
gested by Hpall restriction enzyme (Figure 8B, lane 10). 
Furthermore, in the WT enzyme titration in nicking of 
modified and unmodified oligos in Ni 2+ buffer, the same 
result of preferential nicking of M + oligos over M~ oli- 
gos was obtained, i.e. WT NchGamma itself prefers to 
nick 5mCG-modified DNA in Ni 2+ buffer, a very interest- 
ing and unexpected discovery (Figure 8C). Some double- 
stranded cleavage products were also present (Figure 8A, 
lane 7; Figure 8C, lanes 2 and 3), suggesting different 
catalytic mechanisms in the presence of Mg 2+ and Ni 2+ 
ions. Figure 8D (lanes 1-8) shows the enzyme titration (2- 
fold serial dilution) of 2xMBD2-ND(E59A/R62A) in nick- 
ing of modified oligos in Ni 2+ buffer; the nicked product 
is proportional to the enzyme concentration. Figure 8E 
shows that 2xMBD2-ND(E59A/R62A) is active in nick- 
ing M + oligos in 0.1-5 mM Ni 2+ . Chelating the divalent 
cation by EDTA completely inhibited the nicking activ- 
ity (lane 7, digestion carried out in 10 mM ETDA), while 
WT N.cf)Gamma (full-length) is active in the presence of 
Mg 2+ /Ni 2+ (lane 8) or Mg 2+ alone (lane 9). In a control 
for dsDNA cleavage, unmodified oligos were cleaved by 
Hpall (lane 10). No double-stranded DNA cleavage activ- 
ity was detected by digestion with fusion enzyme 2xMBD2- 
ND(E59A/R62A) on modified oligos in the time period 
tested (30-60 min), although we cannot rule out the pos- 
sibility that dsDNA breaks may take place in much longer 
digestion. It was concluded that WT Ncj>Gamma prefers 
to nick/cleave modified DNA in Ni 2+ buffer. The fu- 
sion enzyme 2xMBD2-ND(E59A/R62A) also prefers to 
nick 5mCG-modified DNA, but lacks dsDNA cleavage ac- 
tivity in Ni 2+ buffer. Three other fusion enzymes tested 
(E59A/R68A, E59A/T119A and E59A/K123A) also dis- 
played 5mCG-dependent nicking in Ni 2+ buffer (data not 
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Figure 7. Purification of 2xMBD2-ND(N.BceSVIII) fusion enzyme and nicking activity assay. (A) SDS-PAGE analysis of purified fusion enzyme 
2xMBD2-ND(N.BceSVIII). The molecular mass of the fusion enzyme is predicted to be ~34.6 kDa. (B) Methylated pBR322 DNA (0.5 u.g) by M.SssI 
(M) or unmodified (U) were digested by 2.5 (jig of the fusion enzyme in buffers containing 10 mM Tris-HCl, pH 7.8, 50 mM of NaCl supplemented with 
10 mM Mn 2+ or 10 mM Mg 2+ and 0.1-0.3 M KCI for 1 h at 37°C. NC: nicked circular; L: linear; SC: supercoiled; U: unmodified DNA; M: methylated 
DNA. (C) Run-off sequencing of modified and nicked DNA in Mn 2+ buffer with 0.3 M KCI. The composite site (CGNi) ACCC N (5 _ 17 ) CG is shown. 
Five CG sites are present in this region. 



shown). Further structural and biochemical studies are 
needed to elucidate the mechanism of 5mCG-dependent 
endonuclease activity of WT N4>Gamma and its variant 
E59A/R62A in buffers with different metal ions. It was 
found previously that gp74 of phage HK97 nicks plasmid 
DNA in Ni 2+ buffer (0.5-10 mM), but prolonged digestion 
of X DNA in Ni 2+ or Zn 2+ buffers for 2-4 h can completely 
collapse dsDNA into a smearing pattern, possibly due to 
introduction of multiple nicks in close proximity (47). 

Labeling 5mCG-modified DNA after nicking 

DNA synthesis and labeling with fluorescently labeled 
dNTP can be performed from the nicks on dsDNA (36). 
M.Sssl-modified pBR322 was first nicked by the fusion 
enzyme 2xMBD2-ND(E59A). Then, strand-displacement 
synthesis (DNA primer extension) from the nicks was per- 
formed by Klenow fragment of E. coli DNA polymerase 
I (or BstI DNA polymerase I large fragment) with dATP, 



dCTP, dGTP and Cy3-dUTR The labeled DNA was re- 
solved by agarose gel and detected by fluorescence imag- 
ing. Figure 9A shows that the methylated DNA was pref- 
erentially nicked and labeled by Cy3-dUMP incorporation 
into the nascent DNA. The nicked DNA was able to be la- 
beled with Bst DNA pol I large fragment and Cy3-dUTP as 
well (data not shown). The unmodified DNA was labeled 
with a lower fluorescence signal probably due to nicks in- 
troduced in other ACYGG sites. Overall, the difference in 
fluorescent signal was significant. When the chromatogram 
was scanned and digitized, the signal ratio is 9:1 for mod- 
ified versus unmodified DNA. We also achieved nicking 
and labeling for mouse genomic DNA since the DNA was 
naturally modified by DNMT1, DNMT3a and DNMT3b 
in the cells. But the background DNA labeling was more 
pronounced, probably due to the random shearing/nicking 
during genomic DNA preparation (Figure 9B). In a nega- 
tive control, X. DNA (unmodified at CG sites, modified at 
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Figure 8. Digestion 5mCG-modified and unmodified duplex oligos in Ni 2+ buffer by the fusion enzyme 2xMBD2-ND(E59A/R62A), or in Mg 2+ and 
N? + buffers by the WT nicking enzyme N.<|>Gamma (full-length). (A) Digestion of M + oligos by the fusion enzyme 2xMBD2-ND(E59A/R62A) in Ni 2 
buffer, by WT N.<f>Gamma in Ni 2+ or Mg 2+ buffer, and by Hpall restriction enzyme in buffer 4, respectively. Arrow indicates nicked product and dashed 
arrow indicates dsDNA cleavage product. (B) Digestion of unmodified (M~) oligos by the fusion enzyme 2xMBD2-ND(E59A/R62A) in Ni 2+ buffer, by 
WT N.<|)Gamma in Ni 2+ or Mg 2+ buffer, and by Hpall restriction enzyme in buffer 4, respectively. (C) Digestion of M + and M~ oligos by the WT enzyme 
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(0.1-5 mM) at 37°C for 30 min. Lane 7 digestion included 1 mM Ni 2+ and 10 mM EDTA. Lane 10, unmodified oligos cleaved by Hpall restriction enzyme. 



CCWGG and GATC by Dcm and Dam methylases, respec- 
tively) was nicked by the fusion enzyme and subjected to 
primer extension/labeling. The nicked \ DNA gave rise to 
similar level of labeling compared to untreated DNA. In 
principle, the Cy3-dUMP-incoporated genomic DNA can 
be stretched in microfluidic-channels and be visualized at 
the labeled composite sites under fluorescence microscope, a 
process of optical imaging and mapping of epigenetic marks 
on normal and cancer DNAs. 



Fusion of nicking domain of N.<j>Gamma to SRA domain 
and Zinc finger protein with 5mC preference, and fusion of 
2xMBD2 to Fokl cleavage domain 

We also constructed a fusion enzyme between the SRA 
domain and the ND E59A (NchGamma). The SRA do- 
main (SET and RING associated (SRA) domain of mouse 
UHRF1) was reported to preferentially bind to hemi- 
methylated DNA (48). The fusion nickase was affinity- 
purified from a chitin column and used to nick hemi- 



methylated and fully methylated duplex oligos. We found 
that the fusion enzyme SRA-ND(E59A of N.ct>Gamma) was 
able to nick both hemi-methylated and fully methylated 
DNA (data not shown). A digestion condition for prefer- 
ential nicking of hemi-methylated DNA remains to be op- 
timized. 

A Zinc finger protein, Zfp57, is able to bind methylated 
site TGC5mCGC that contains a 5mCG dinucleotide (49). 
The fusion of Zfp57 DNA binding domain with the NDs of 
NchGamma or N.BceSVIII or gp74 of phage HK97 can po- 
tentially generate 5mC-dependent nickases with rare speci- 
ficities (6+4.5 = 10.5 bp). TAL effector proteins that can 
bind to methylated sites have been published previously 
(50). But construction of fusion nickases targeting methy- 
lated sites remains to be carried out. 

Another strategy to construct 5mCG-dependent nick- 
ing enzyme is to use a catalytic-deficient Fokl cleavage do- 
main in fusion with 5mCG-binding domain proteins (51). 
Such fusion protein can bind to 5mCG sites, but it can- 
not cleave or nick. Fokl mutants that can form transient 
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Figure 9. Labeling of nicked plasmid and genomic DNAs by strand- 
displacement DNA synthesis. (A) One microgram of pBR322 methylated 
by M.SssI (M+) or unmodified (M— ) was nicked by 2.5 (jig of 2xMBD2- 
ND(E59A) in Mg 2+ buffer containing 0.25 M KC1 for 1 h at 37°C. The 
labeling was performed through 5 min incubation at 25°C with 10 units 
of Klenow fragment of E. coli DNA polymerase I and a dNTP mix con- 
taining Cy3-dUTP. (B) NIH 3T3 mouse genomic DNA was nicked by 2.5 
(jig of 2xMBD2-ND(E59A) in a Mg 2+ buffer containing 0.3 M KC1 for 
1 h at 37°C. DNA labeling was performed through 10 min incubation at 
25°C with 10 units of Klenow fragment of £ coli DNA polymerase I and 
a dNTP mix containing Cy3-dUTP. The control enzyme Nt.BspQI nicks 
the DNA sites GCTCTTCNJ.N. 



dimer and produce strand -specific nicks have been con- 
structed (12). A catalytic -proficient, dimerization-proficient 
and DNA binding-deficient full-length Fokl mutant can 
be added and rescue the activity of MBD-FokIR* (cleav- 
age domain R*: catalytic-deficient) by formation of a tran- 
sient dimer complex: MBD-FokIR*/FokI full-length mu- 
tant R + B~ (cleavage-proficient, binding-deficient). Such 
engineered enzyme complex may produce 5mCG-specific 
nicks on DNA. Alternatively, one can engineer a single 
polypeptide of FokIR* and FokIR" 1 " by fusion of catalytic- 
deficient and catalytic-proficient cleavage domains. The 
forced FokIR*/R + dimer can be subsequently fused to 
5mCG-binding domain or 5mC-binding ZFP to create 
5mC-specific nicking enzymes. We had constructed a fu- 
sion protein of 2xMBD2-FokI* (catalytic-deficient), but 
the fusion protein was poorly expressed and only small 
amount of protein can be purified from 2L of IPTG- 
induced cells. Optimized expression condition and pu- 
rification method remains to be discovered. In theory, 
other DNA cleavage/NDs from DNA endonucleases, nick- 
ases, integrases, recombinases, transposases, HEases and 
DNA repair enzymes can be fused to the 5mCG-binding 
domain of MBD proteins and generate 5mCG-specific 
cleavage/nicking enzymes. 



CONCLUSION 

We isolated 54 single mutants and 3 double mutants of 
N.cf>Gamma mainly targeting charged and hydrophilic aa 
residues or predicted catalytic residues. The importance 
of the predicted catalytic residues D89, H90, N106 and 
HI 15 was confirmed by mutagenesis. The catalytic mutants 
D89A, H90A, N106A and H115A are inactive in Mg 2+ 
buffer, but the activity of H90A, N106A and H115A can 
be partially restored in Mn 2+ buffer. Double mutants of 
catalytic residues of H90, N106 and HI 15 are inactive in 
Mn 2+ buffer, indicating the H90, N 106, and HI 1 5 and other 
nearby residues can play an accessory role in Mn 2+ metal 
ion binding and activation of water molecule. The NDs 
of E59A and some double mutants were fused to 5mCG- 
binding domain of MBD2 and generated fusion enzymes 
that preferentially nick 5mCG-modified DNA. The CG 
dinucleotide can be modified by C5 MTases such as M.SssI, 
M.Hhal or M.Hpall to create composite sites (CG Nx) 
ACYGG N (8 -i5) CG. Other C5 MTases with 5-8 bp speci- 
ficity can potentially be used to create rare composite nick- 
ing sites. We found that WT N.c[>Gamma and fusion en- 
zymes with ND derived from N.4>Gamma prefer to nick 
5mCG-modified oligos in Ni 2+ buffer despite sub-optimal 
nicking condition. We also constructed a fusion nicking en- 
zyme 2xMBD2-ND(N.BceSVIII) targeting more frequent 
composite nicking sites in Mn 2+ buffer. The fusion enzymes 
can be used to nick and label 5mCG-modified DNA with 
fluorescently labeled Cy3-dUTP and potentially be useful 
for DNA optical mapping and DNA sequencing since a 
doublet peak would appear at the nicking sites near 5mCG. 
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